Ultra-wide Baseline Aerial Imagery Matching in Urban Environments
نویسندگان
چکیده
Correspondence matching is a fundamental problem in computer vision, having many uses in structure from motion, stereo vision, image registration, pose-estimation, and others. Today, a large amount of aerial imagery is available online via mapping services such as Google Maps. If we were to pick any pair of images from two different aerial views (see Figure 1 for an example), and perform SIFT-based [3] correspondence matching, we would find ourselves with a large number of mismatches due to the large distortions between the images. Even when augmenting these methods with typical robust approaches such as RANSAC [1] and its variants, we would still fail at finding correct correspondences since RANSAC has difficulty calculating the correct model without a large ratio of correct matches to outliers. These difficulties – large distortions, and low ratio of correct matches to outliers – together render traditional methods ineffective. In this paper, we consider the problem of correspondence matching for aerial imagery in urban environments. Our approach builds on multiple ideas in the literature. Namely, A-SIFT [7], patch-based methods [6], Generalized RANSAC framework [8], self-similarity [5], graphbased image matching [2], and geometric-invariance [4]. The main idea behind this work is to combine view-synthesis with multiple point correspondences under a RANSAC-based scheme. Robust model estimation is supported by self-similarity principles and graph-based modeling that drives the sampling process in a restricted manner that allows the correct model to be extracted. Each of these ideas was chosen to deal with specific problems that cause failures in the earlier approaches as we will now briefly describe. An overview of our pipeline is shown in Figure 2. We begin by extracting square patches around detected Harris corners, which we then describe using the Histogram of Oriented Gradients. Our model assumes affine distortions, which leads us to synthesizing transformations to account for probable changes between the two images. We apply our transformations to one of the input pairs only, as follows: Scale = Sx 0 0 0 Sy 0 0 0 1 Shear = 1 Shx 0 Shy 1 0 0 0 1 Rotation = cos(θ) −sin(θ) 0 sin(θ) cos(θ) 0 0 0 1
منابع مشابه
Provide a Deep Convolutional Neural Network Optimized with Morphological Filters to Map Trees in Urban Environments Using Aerial Imagery
Today, we cannot ignore the role of trees in the quality of human life, so that the earth is inconceivable for humans without the presence of trees. In addition to their natural role, urban trees are also very important in terms of visual beauty. Aerial imagery using unmanned platforms with very high spatial resolution is available today. Convolutional neural networks based deep learning method...
متن کاملA System for the Generation of Synthetic Wide Area Aerial Surveillance Imagery
The development, benchmarking and validation of aerial Persistent Surveillance (PS) algorithms requires access to specialist Wide Area Aerial Surveillance (WAAS) datasets. Such datasets are difficult to obtain and are often extremely large both in spatial resolution and temporal duration. This paper outlines an approach to the simulation of complex urban environments and demonstrates the viabil...
متن کاملTracking Ground Based Targets in Aerial Video with Dual-Tree Complex Wavelet Polar Matching and Particle Filtering
The detection and tracking of targets in aerial imagery of cluttered urban environments is addressed. Polar matching, using dual-tree complex wavelet transforms, is used as a shift and rotation invariant detector. A particle filter is employed to add robustness, especially in the event of target occlusion. We show that, together, these methods can robustly track a ground based target as it beco...
متن کاملImage-based 3D Data Capture in Urban Scenarios
Presuming that airborne imagery is available at a sufficient overlap, state-of-the-art multi-stereo matching can generate DSM raster representations at an accuracy and resolution which corresponds to the ground sampling distance (GSD) of the original images. For such secenarios recent matching software exploits the resulting redundancy and derives surface representations at a remarkable accurac...
متن کاملVehicle Detection and Orientation Estimation Using the Radon Transform
Determining the location and orientation of vehicles in satellite and airborne imagery is a challenging task given the density of cars and other vehicles as well as the complexity of the environment in urban regions around the world. We describe an accurate and flexible method for detecting vehicles using a template-based directional chamfer matching approach, and vehicle orientation estimation...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2013